Model Selection

High-Precision Quantization

# High-Precision Quantization

Qwen3 is the latest generation of large language models in the Tongyi Qianwen series, offering a complete suite of dense models and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in reasoning capabilities, instruction following, agent functionalities, and multilingual support.

Large Language Model English

Qwen3 1.7B GGUF

Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a range of dense and mixture of experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in reasoning, instruction following, agent capabilities, and multilingual support.

Large Language Model English

Qwen Qwen2.5 VL 72B Instruct GGUF

A quantized version of the Qwen2.5-VL-72B-Instruct multimodal large language model, supporting image-text-to-text tasks, suitable for various quantization levels from high precision to low memory requirements.

Text-to-Image English

Qwen Qwen2.5 VL 7B Instruct GGUF

A quantized version of Qwen2.5-VL-7B-Instruct, using llama.cpp for quantization, supporting multimodal tasks such as image-to-text conversion.

Text-to-Image English

Nvidia OpenCodeReasoning Nemotron 32B IOI GGUF

This is the quantized version of the NVIDIA OpenCodeReasoning-Nemotron-32B-IOI model, processed using llama.cpp for quantization, suitable for code reasoning tasks.

Large Language Model Supports Multiple Languages

Nomic Ai Nomic Embed Code GGUF

This is the quantized version of the nomic-ai/nomic-embed-code model, using llama.cpp for imatrix quantization, suitable for code embedding and feature extraction tasks.

Nomic Embed Code GGUF

The Nomic code embedding model is a top-tier code retrieval tool that supports multiple programming languages and excels in code retrieval tasks.

Gemma 3 27b Tools Q5 K M GGUF

This model is a GGUF format version converted from Gemma-3-27b-tools, suitable for local inference tasks.

Large Language Model

Mlabonne Gemma 3 4b It Abliterated GGUF

This is a quantized version based on the mlabonne/gemma-3-4b-it-abliterated model, using llama.cpp for imatrix quantization, suitable for image-text-to-text tasks.

Mlabonne Gemma 3 27b It Abliterated GGUF

A quantized version based on Google Gemma 3B model, optimized using llama.cpp, supporting multiple quantization levels, suitable for text generation tasks.

Large Language Model

Gemma 3 12b It GGUF

Gemma-3-12b-it is a large language model developed by Google, based on the transformer architecture, focusing on text generation tasks.

Large Language Model

Open R1 OlympicCoder 32B GGUF

Quantized version of OlympicCoder-32B, based on llama.cpp's imatrix quantization method, suitable for code generation tasks.

Large Language Model English

Qwq 32B Preview IdeaWhiz V1 GGUF

A 32B-parameter large language model based on llama.cpp, specializing in text generation tasks for chemistry, biology, climate, and medical fields

Large Language Model English

QVQ 72B Preview AWQ

QVQ-72B-Preview is an experimental research model developed by the Qwen team, focusing on enhancing visual reasoning capabilities. This repository provides its AWQ 4-bit quantized version.

Transformers English

Qwen2 VL 2B Instruct GGUF

Qwen2-VL-2B-Instruct is a multimodal vision-language model that supports image-text generation tasks, based on the Qwen2 architecture with a parameter scale of 2B.

Image-to-Text English

Flan T5 Large Grammar Synthesis Gguf

A GGUF-format T5 model for grammar and spelling correction, supporting high-precision quantization versions to ensure correction quality.

Large Language Model English

OPEN SOLAR KO 10.7B GGUF

This is a GGUF-format quantized version of the beomi/OPEN-SOLAR-KO-10.7B model, supporting 2-8 bit quantization levels, suitable for Korean and English text generation tasks.

Large Language Model Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase